Large Scale Similar Song Retrieval using Beat-aligned Chroma Patch Codebook with Location Verification
نویسندگان
چکیده
With the popularity of song search applications on Internet and mobile phone, large scale similar song search has been attracting more and more attention in recent years. Similar songs are created by altering the volume levels, timing, amplification, or layering other songs on top of an original song. Given the large scale of songs uploaded on the Internet, it is demanding but challenging to identify these similar songs in a timely manner. Recently, some state-of-the-art large scale music retrieval approaches represent songs with a bag of audio words by quantizing local features, such as beat-chroma patches, solely in the feature space. However, feature quantization reduces the discriminative power of local features, which causes many false audio words matches. In addition, the location clues among audio words in a song is usually ignored or exploited for full location verification, which is computationally expensive. In this paper, we focus on similar song retrieval, and propose to utilize beat-aligned chroma patches for large scale similar song retrieval and apply location coding scheme to encode the location relationships among beat-aligned chroma patches in a song. Our approach is both efficient and effective to discover true matches of beat chroma patches between songs with low computational cost. Experiments in similar songs search on a large song database reveal the promising results of our approach.
منابع مشابه
Clustering Beat-Chroma Patterns in a Large Music Database
A musical style or genre implies a set of common conventions and patterns combined and deployed in different ways to make individual musical pieces; for instance, most would agree that contemporary pop music is assembled from a relatively small palette of harmonic and melodic patterns. The purpose of this paper is to use a database of tens of thousands of songs in combination with a compact rep...
متن کاملData Driven and Discriminative Projections for Large-Scale Cover Song Identification
The predominant approach to computing document similarity in web scale applications proceeds by encoding taskspecific invariance in a vectorized representation, such that the relationship between items can be computed efficiently by a simple scoring function, e.g. Euclidean distance. Here, we improve upon previous work in large-scale cover song identification by using data-driven projections at...
متن کاملLarge-Scale Cover Song Recognition Using the 2D Fourier Transform Magnitude
Large-scale cover song recognition involves calculating itemto-item similarities that can accommodate differences in timing and tempo, rendering simple Euclidean measures unsuitable. Expensive solutions such as dynamic time warping do not scale to million of instances, making them inappropriate for commercial-scale applications. In this work, we transform a beat-synchronous chroma matrix with a...
متن کاملIdentifying ‘Cover Songs’ with Beat-Synchronous Chroma Features
Large music collections, ranging from thousands to millions of tracks, are unsuited to manual searching, motivating the development of automatic search methods. When two musical groups perform the same underlying song or piece, these are known as ‘cover’ versions. We describe a system that attempts to identify such a relationship between music audio recordings. To overcome variability in tempo,...
متن کاملSparse Codebook Model of Local Structures for Retrieval of Focal Liver Lesions Using Multiphase Medical Images
Characterization and individual trait analysis of the focal liver lesions (FLL) is a challenging task in medical image processing and clinical site. The character analysis of a unconfirmed FLL case would be expected to benefit greatly from the accumulated FLL cases with experts' analysis, which can be achieved by content-based medical image retrieval (CBMIR). CBMIR mainly includes discriminated...
متن کامل